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O " «A SEQUENCE LISTING 

JUL 2 D 1998 ' * 

(1) GENERAL INFORMATION: 

v , .(X) APPLICANT: Muller-Rober, Bernd 

j." ' * Land- Schutze, Volker 

La Cognata, Ursula 

(ii) TITLE OF INVENTION : PROCESSES FOR INHIBITING AND FC 
INDUCING FLOWER FORMATION IN PLANTS 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : FISH & NEAVE 

(B) STREET: 1251 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10020 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DC-S/MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1.3 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/702,718 

(B) FILING DATE: 19-MAR-1997 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DE P4408629.6 

(B) FILING DATE: 09-MAR-1994 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DE P4435366.9 

(B) FILING DATE: 22-SEP-1994 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DE P4 43 8 82 1.7 

(B) FILING DATE: 19-OCT-1994 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Haley Jr., James F. 

(B) REGISTRATION NUMBER: 27,794 

(C) REFERENCE/DOCKET NUMBER: AGREVO-1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 212-59 6-9000 

(B) TELEFAX : 212-596-9090 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1891 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solatium tuberosum 

(B) STRAIN: c.v. Desiree 
(F) TISSUE TYPE: leaf 



(vii) IMMEDIATE SOURCE: . 

(A) LIBRARY: cDNA library in pBluescnptKS 

(B) CLONE: pCBS 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 1485 

(D) OTHER INFORMATION: /EC_number- 4.1.3. 
/product= "Citrate synthase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TTTTTCGTTC CAT CAGCCTA CTTGAGATGT ATTCCCACTG GT AAAAGT T A ATTTTTTTGA 

TTTTCGCGAG CA ATG GTG TTC TAC CGT AGC GTT TCG TTG CTG TCA AAG 
TTTTCGCGAG CA ^ ^ ^ ^ ^ ^ ^ Leu Leu Ser Lys 

1 5 10 

CTC CGC TCT CGA GCG GTC CAA CAG TCA AAT GTT AGC AAT TCT GTG CGC 
Leu S Ser Arg Ala Val Gin Gin Ser Asn Val Ser Asn Ser Val Arg 
15 20 25 

TGG CTT CAA GTC CAA ACC TCT TCC GGT CTT GAT CTG CGT TCT GAG CTG 
Irt III VaT Gin Thr Ser Ser Gly Leu Asp Leu Arg Ser Glu Leu 



30 



35 



CTA CAA GAA TTG ATT CCT GAA CAA CAG GAT CGC CTG AAA AAG ATC AAG 
vll ctS gJS llu lie Pro Glu Gin Gin Asp Arg Leu Lys Lys He Lys 



45 50 



TCA GAT ATG AAA GGT TCA ATT GGG AAC ATC ACA GTT GAT ATG GTT CTT 
Ser Sp Se? i£ Gly Ser He Gly Asn lie Thr Val Asp Met Val Leu 



65 



GGT GGA ATG AGA GGA ATG ACA GGA TTA CTG TGG AAA CCT CAT TAC CTT 
Gly Gly Met Arg Gly Met Thr Gly Leu Leu Tr P Lys Pro His Tyr Leu 
80 85 

GAC CCT GAT GAG GGA ATT CGC TTC CGG GGG TTG TCT ATA CCT GAA TGC 
Sp Pro Sp Glu Gly He Arg Phe Arg Gly Leu Ser lie Pro Glu Cys 
K 95 100 105 

CAA AAG GTA TTA CCT GCA GCA AAG CCT GGG GGT GAG CCC TTG CCT GAA 
ijs Sal Leu Pro Ala Ala Lys Pro Gly Gly Glu Pro Leu Pro Glu 
110 H5 120 

CTT CTC TGG CTT CTT TTA ACA GGA AAG GTG CCA TCA AAA GAG CAA 
Gly III Leu lr P Leu Leu Leu Thr Gly Lys Val Pro Ser Lys Glu Gin 



60 
108 

156 

204 

252 

300 

348 

396 

444 

492 



125 



130 



54 



GTG AAT TCA ATT GTC TCA GGA ATT OCA GAG TOG GGC ATC ATA TCC CTG 
Val Asn Ser He Val Ser Gly He Ala Glu Ser Gly ^ 



145 150 



S S S E s e s s s e s e s s e s 



160 165 



s s s e s s s e k s e a i ss s e 

S E S S E E E 5 5 E - E E - S S 
S E E E 5 E S E 2 S E E SI E E E 

210 

TcT OCT TAT GTT TAT CGC AGG ATG TAC AAG AAT GGT GAC ACT ATA COT 
Ala Ala Tyr Val Tyr Arg Arg Met Tyr Lys Asn Gly P ^ 

E S E E S S E E S E 5 E SS E S E 
E E E E K S E E E E E 5 S E !S E 

e 5 e s e E E e e e e e e e e 

E E E S E E E E S E 2 E E E E E 
E E E E E E E E S E E E E E E E 

305 31U 



E E E S E S E E E E E S E E E E 



320 325 



E S E E E E E E E E S S E E E E 



335 340 



E E E E E E E E E E E E E E E E 



350 3 " 



TAT ACA TGC CAG AGA GAG TTC GOT ATG AAG CAT TTG CCT GAA GAT CCA 
Tyr Thr Cys Gin Arg Glu Phe Ala Met Lys His Leu Fro y ^ 

365 370 37 



540 



588 



636 



684 



731 



780 



828 



876 



924 



972 



1020 



1068 



1116 



1164 



1212 
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S E S S S E S S K JS S S S S S 



385 



SK E S S 5! S K K i S SS S S 2S K 



400 



sj s ss s: is - s ™ - - - i - ™ - 

415 420 

S 2S S S Sf s s s s ss s ss 2 s s s 

4 35 Qhu 

1% ~ S E 2S 5 s s s s: i ss 3: = E s 

445 450 

A TG GAG TGG CTT GAG AAC CAG TGC AAG AAA GCA TGAATTGTTT GAAATCTCGC 
Met Glu Trp Leu Glu Asn Gin Cys Lys Lys Ala 



465 



GAGC AT AAAA C ACAAT GT AT AATCTCTATG AATAATTGCT T GACAAAGC A CTCCTTTCTT 
GGGGGACAAG ATAGGTCGGC CCTTCAATGG GTTAACGAAC TTCAGTTCAA ACTTCACTGA 
ATTTGTGTGA AT T GT AT GGT TTCTCGAGAC TTGTCCTGAA TTTTGAACTT AGTCTAGTGG 
ATTCATTTTT CTTCATTCCG AATTCCTCAC ACGCTGATCC AGC AT GTAAA AATTAATAGG 
TCAATGCTAT TAATCGCGTT CTTGGTTGCC ATTAGACTTG T GAAT GACT T CCTTTGCTGG 
AAAGTTAGTA ATCGGCTGAT TCACGCAATA AACTGCAATT GT GT AGT T T C TTAAATTTGC 
TAATTCTTAT T T GAT GAT AT TATGAA 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
Met Val Phe Tyr Arg Ser Val Ser Leu Leu Ser Lys Leu Arg Ser Arg 

1 5 10 

Ala Val Gin Gin Ser Asn Val Ser Asn Ser Val Arg Trp Leu Gin Val 

20 25 
Gin Thr Ser Ser Gly Leu Asp Leu Arg Ser Glu Leu Val Gin Glu Leu 

ACS 4 3 



lie Pro Glu Gin Gin Asp Arg Leu Lys Lys He Lys Ser Asp Met Lys 
50 " 60 



1260 

1308 

1356 

1404 

1452 

1505 

1565 
1625 
1685 
1745 
1805 
1865 
1891 



56 - 



Gly ser lie Gly Asn He Thr val Asp Met Val Leu Gly Gly Met Arg 

65 7 ^ 

85 90 
Gly Xle Arg Phe Arg Gly Leu Ser lie Pro Glu Cys Gin Lys Val Leu 



100 



Pro Gly Gly Glu Pro Leu Pro Glu Gly Leu Leu Trp 
120 



Pro Ala Ala Lys Pro G±y - — ^ 

115 1 



L eu Leu Leu Thr Gly Lys Val Pro Ser Lys Glu Gin Val Asn Ser He 

130 135 
val ser Gly He Ala Glu Ser Gly He He Ser Leu He He Met Tyr 

145 150 -LOD 

Thr Thr lie Asp Ala Leu Pro Val Thr Ala Hi- Pro Met Thr Gin Phe 

165 1/U 

Ala Thr Gly Val Met Ala Leu Gin Val Gin Ser Glu Phe Gin Lys Ala 

180 iy:3 

T yr Glu Lys Gly He Hi, Lys Ser Lys Tyr Trp Glu Pro Thr Tyr Glu 

195 200 

Asp ser Met Asn Leu He Ala Gin Val Pro Leu Val Ala Ala Tyr Val 

210 215 
T yr Arg Arg Met Tyr Lys Asn Gly Asp Thr He Pro Lys Asp Glu ser 



225 



230 



L eu Asp Tyr Gly Ala Asn Phe Ala His Met Leu Gly Phe Ser Ser Ser 

O A ^ Z DU 



245 



Glu Met His Glu Leu Leu Met Arg Leu Tyr Val Thr He Hi. Ser Asp 

260 2b ^ 
His Glu Gly Gly Asn Val Ser Ala His Thr Gly His Leu Val Ala Ser 

275 280 



Ala Leu Ser Asp 
290 



Pro Tyr Leu Ser Phe 



Ala Ala Ala Leu Asn Gly Leu 



295 



300 



Ala Gly Pro Leu His Gly Leu Ala As 



n Gin Glu Val Leu Leu Trp He 



305 



310 



315 



320 



Lys Ser Val Val Glu Glu Cys Gly Glu Asn He Ser Lys Glu Gin Leu 

325 330 

L ys Asp Tyr Val Trp Lys Thr Leu Asn Ser Gly Lys Val Val Pro Gly 

340 34:3 
Phe Gly His Gly Val Leu Arg Lys Thr Val Pro Arg Tyr Thr cys Gin 

355 360 
Arg Glu Phe Ala Met Lys His Leu Pro Glu Asp Pro Leu Phe Gin Leu 
370 375 
1 ser Lys Leu Tyr Glu Val Phe Leu Leu Phe Leu Gin Asn Leu Ala 



Va 
385 



390 
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Lys Leu Lys Pro Trp Pro Asn Val Asp Ala His Ser Gly Val Leu Leu 

405 410 

Asn Tyr Tyr Gly Leu Thr Glu Ala Arg Tyr Tyr Thr Val Leu Phe Gly 

420 4 " 
Val Ser Arg Ala Leu Gly He cys Ser Gin Leu He Trp Asp Arg Ala 

L eu Gly Leu Pro Leu Glu Arg Pro Lys Ser Val Thr Met Glu Trp Leu 

450 455 

Glu Asn Gin Cys Lys Lys Ala 
465 470 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Beta vulgaris 

(B) STRAIN: Zuchtlmie 5S 0026 
(F) TISSUE TYPE: leaf 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: pSBCS 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1311 

(D) OTHER INFORMATION: /EC_number- 4.1..*./. 
/product^ "citrate synthase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TCC TCT AAC CTT GAC CTT CGT TCA GAG TTA CAA 
Ser Ser Asn Leu Asp Leu Arg Ser Glu Leu Gin 
1 5 io 

CAA CAG GAA CGA CTG AAG AAG ATA AAG AAA GAA 
Gin Gin Glu Arg Leu Lys Lys lie Lys Lys Glu 
20 25 

CTG GGG AAT ATC AAT GTT GAC ATG GTA TTG GGC 
Leu Gly Asn He Asn Val Asp Met Val Leu Gly 
35 40 

ACT GGT TTA CTT TGG GAG ACT TCC TTA CTC GAC 
Thr Gly Leu Leu Trp Glu Thr Ser Leu Leu Asp 
50 55 

CGG TTC AGG GGT TTT TCT ATA CCT GAA TGC CAG 
Arg Phe Arg Gly Phe Ser He Pro Glu Cys Gin 
65 70 75 



GAA CTG ATT 
Glu Leu He 



TTT GGA AGT 
Phe Gly Ser 
30 

GGA ATG AGA 
Gly Met Arg 
45 

CCA GAA GAG 
Pro Glu Glu 
60 

AAA CTT TTA 
Lys Leu Leu 



CCT GAA 
Pro Glu 
15 

TTC CAG 
Phe Gin 



GGA ATG 
Gly Met 



GGT ATC 
Gly He 



CCC GCT 
Pro Ala 
80 



48 



96 



144 



192 



240 



58 - 



S 5= S SS S K S S £ K 5 S S 5 S S 
S S 5 iS vll S E K i SK v2 S S S S S 
£ ™ S i» 25 S E f 5 S E S£ ?J E ^ S 
5 S 25 S S £ S S S K S S! 3 5 S S 
!S K s s sk S S iS Si S Sj JS S 5 SI K 

15 0 J- 33 

? y s s 3: e s = - =s i - " s: s i s 

165 

»* r s s ss s fj 511 s £ - ™ m s 

Ser Leu lie Ala Gin vai rru 19Q 
180 ltf:3 

„. TAT »« »T « ™ -J CC. CT= JJT «C TCC CTT CAT x» 

Met Tyr Lys Asn Gly Gin Val lie Pro i,eu P v 2Qb 

r, s ss s s s k s s s s s s s: s s 

S K S 5 S SS S S S SS i 5 S Si 5 |i 

225 230 2 

r-^ nrT rrc CAT TTG GTG GGT AGT CCA CTT TCA GAT 

^ SS if. S SS S S SK »« VJ1 «y ~r «o ,.u .« «P 

245 250 



- £ f ; i S SS SK Si 21 - SS SS S i - - 
S S S S E SS Si i 2 S 5 S K S - !S 
S K 5 S Si SS i S S S S S S S 

s 5 s s « n = s 3: v-s -i s s s s s i 

305 310 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



- 59 - 

CTA TTG CGG AAG ACA GAT CCA AGA TAC ACA TGC CAA AGA GAA TTT GCG 
?£ llu Arg Tys Thr Asp Pro Arg Tyr Thr Cys Gin Arg Glu Phe Ala 
325 330 

TTG AAG CAC TTG CCT GAT GAC CCA TTT TTT CAA TTG GTG TCA AAG TTG 
Leu Lys His Leu Pro Asp Asp Pro Phe Phe Gin Leu Val Ser Lys Leu 



TAT GAA GTG GTG CCT CCT ATT CTA TTA GAG CTT GGA AAG GTA AAG AAT 
Tyl G^u" vll val Pro Pro He Leu Leu Glu Leu Gly Lys Val Lys Asn 
Y 355 3 60 365 

rrn TGG CCT AAT GTT GAT GCT CAT AGT GGA GTT TTG CTG AAC CAC TAT 
Tro Trp Pro ^l Val Asp Ala His Ser Gly Val Leu Leu Asn His Tyr 
370 3 "?5 380 

GGT TTG ACA GAA GCA AGA TAC TAT ACG GTT TTG TTT GGG GTA TCA AGG 
Gly Leu Thr Glu Ala Arg Tyr Tyr Thr Val Leu Phe Gly Val Ser Arg 



385 



390 



AGT CTT GGA ATA TGC TCA CAG CTT ATA TGG GAC CGA GCT CTT GGC TTG 
sir Leu Gly He Cys Ser Gin Leu He Trp Asp Arg Ala Leu Gly Leu 
405 410 

CCG CTA GAG AGG CCA AAG AGT GTC ACT ATG GAA TGG CTT GAA AAG TTT 
Pro Leu Glu Arg Pro Lys Ser Val Thr Met Glu Trp Leu Glu Lys Phe 
420 425 430 

TGT AAA AGA AGA GCA TA AC AT T GAT GA CATATCAACT CACTGTTGTT 

Cys Lys Arg Arg Ala 
435 

CTTTGTCGAA TCTACAATAA TAT AGT T T GA G GGA CAA GAA AG AAT TTT AT T T T C GGA GAT 
GAGAT AAGC G AGGACTCAGA AAC AT AGT T T TCTTTGTCTC TTGCTGAGGT TTGCGTTTTA 
TATATTTCAC TTGTAAATAT AT T GT AT GGT TT CTT GAT CA AAAC AT GAGA T AAAGAGT T T 
TCATAAAAAA AAAAAAAAAA AAAAAAAA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Ser Ser Asn Leu Asp Leu Arg Ser Glu Leu Gin Glu Leu He Pro Glu 

1 5 10 

Gin Gin Glu Arg Leu Lys Lys He Lys Lys Glu Phe Gly Ser Phe Gin 



20 25 

Leu Gly Asn He Asn Val Asp Met Val Leu Gly Gly Met Arg Gly Met 
35 40 45 



1008 

1056 

1104 

1152 

1200 

1248 

1296 

1343 

1403 
1463 
1523 
1551 
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50 55 



Arg Phe Arg Gly Phe Ser He Pro Glu Cys . qo 

65 70 



Gin Lys Leu Leu Pro Ala 



Ala Ser Ala Gly Ala Glu Pro Leu P --- . g5 



ro Glu Gly Leu Leu Trp Leu Leu 



85 



90 



L eu Thr Gly Lys Val Pro Ser Lys Glu Gin Val Asp Ala Leu Ser Ala 

100 1Ub 
ASP Leu Arg Lys Arg Ala Ser lie Pro Asp Hi- Val Tyr Lys Thr He 

115 12U 
ASP Ala Leu Pro He Thr Ala His Pro Met Thr Gin Phe Cys Thr Gly 

130 135 
val Met Ala Leu Gin Thr Arg Ser Glu Phe Gin Lys Ala Tyr Glu Lys 
145 150 

Gly lie His Lys Ser Lys Phe Trp Glu Pro Thr Tyr Glu Asp Cys Leu 

165 1/u 
ser Leu He Ala Gin Val Pro Val Val Ala Ala Tyr Val Tyr Arg Arg 

180 

Met Tyr Lys Asn Gly Gin Val He Pro Leu Asp Asp Ser Leu Asp Tyr 

195 zuu 
Gly Gly Asn Phe Ala His Met Leu Gly Phe Asp Ser Pro Gin Met Leu 

210 215 
Glu Leu Met Arg Leu Tyr Val Thr He His Ser Asp His Glu Gly Gly 
225 230 

„ val ser Ala His Thr Gly His Leu Val Gly Ser Pro Leu Ser Asp 



As 



245 



Pro Tyr Leu Ser Phe Ala Ala Ala Leu Asn Gly Leu Ala Gly Pro Leu 

260 26b 



Gly Leu Ala Asn Gin Glu Val Leu Leu Trp He Lys Ser Val Val 

275 280 
Glu cys Gly Glu Asn He Ser Thr Glu Gin Leu Lys Asp Tyr Val 
290 295 
Trp Lys Thr Leu Asn Ser Gly Lys Val Val Pro Gly Phe Gly Leu Gly 
o n c: 310 



val Leu Arg Lys Thr Asp Pro Arg Tyr Thr Cys Gin Arg Glu Phe Ala 

Leu Lys His Leu Pro Asp Asp Pro Phe Phe Gin Leu Val Ser Lys Leu 

340 34 ^ 

Tyr Glu Val Val Pro Pro He Leu Leu Glu Leu Gly Lys Val Lys Asn 

355 360 

r i t\ H ic ser Glv Val Leu Leu Asn Hxs Tyr 

Pro Trp Pro Asn Val Asp Ala His ber ^±y ^ 

370 375 



61 



Gly Leu Thr Glu Ala Arg Tyr Tyr Thr Val Leu Phe Gly V.l Ser Arg 

,t 390 Jy:=) 



385 



ser Leu Gly He Cys Ser Gin Leu He Trp Asp Arg Ala Leu Gly Leu 

405 41U 
Pro Leu Glu Arg Pro Lys Ser Val Thr Met Glu Trp Leu Glu Lys Phe 



420 



Cys Lys Arg Arg Ala 
435 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Nicotiana tabacum 
(F) TISSUE TYPE: leaf 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: TCS 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 70.. 1476 

(D) OTHER INFORMATION: /EC_number- 4.1. 
/product^ "citrate synthase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GCTCTTGGGA TCTATTTCCT CTCTCTATTT CTCCCTAGGT AAAAGTTAAT TTGTTGATTT 

TTGCGAGCC ATG GTG TTC TAT CGC GGC GTT TCT CTG CTG TCA AAG CTG 
TTGCGAGCC AT^ ^ ^ ^ ^ ^ ^ ^ ^ Ser Lys Leu 

1 5 10 

TCT TCT CGA GCG GTC CAA CAG ACA AAT CTT AGC AAC TCT GTG CGG TGG 
Srg Ser Arg Sa Val Gin Gin Thr Asn Leu Ser Asn Ser Val Arg Trp 
15 20 25 

CTT CAA GTC CAA ACC TCT TCT GGT CTT GAT CTG CGT TCT GAG CTG CAA 
III vll Gin Thr Ser Ser Gly Leu Asp Leu Arg Ser Glu Leu Gin 

30 35 

s: SS S SS - ^ s s - E S5 % Si ss 

SI £ £ - S! leu SS ^ - £ v" S val s t » tz 

65 70 



60 
108 

156 

204 

252 

300 



62 - 



E e is si e e E ™ e e si e e - e 

80 85 

- e e e s e e e e e e e E - e si 

E E E £ S S E £ 5 5 E E 3 £ E E 

S E E E E E - E - i - - - - E E 
E E E IS E Si E E E E E E E E E E 

e - e S s E e e e e e e e e e e 



160 



165 



S E E E 5 S E E E E E 5= K E E 



175 



180 



E S 5 25 E i S E E S E S - Si £ E 

E E E E E E E E E E E £ £ E E S 

210 215 

TAT GTT TAT CGC AGG ATG TAC AAG AAC GGC AAC ACT ATA OCT AAG GAT 
Tyr Val Tyr Arg Arg Met Tyr Lys Asn Gly Asn Thr ixe ^ y 

225 230 
G AC TCA CTG GAT TAT GGT GCA AAT TTT GOT CAC ATG CTT GGT TTC AGT 
240 245 



E e e E E E e «; ™ «. - - «» s - 

e e E e e e e e E e E i E - ~ - 

255 260 2 

E S Si E E E - E E E E E E E E E 
5 SI £ E E E E S S i S E E S E E 

E E Si E S E E Si E 5i T „ E Si E E E E 



348 



396 



444 



492 



540 



588 



636 



684 



732 



780 



828 



876 



924 



972 



1020 



305 



63 



ATC AAA TCT GTT GTA GAG GAG TGT GGG GAG AAC ATT TCC AAA GAG CAG 
?Ie Ser Val Val Glu Glu Cys Gly Glu Asn He Ser Lys Glu Gin 

325 



320 



TTG AAA GAC TAG GCT TGG AAA ACA TTG AAA AGT GGC AAG GTT GTC CCT 
leu i£ Sp Tyr Ala Trp Lys Thr Leu Lys Ser Gly Lys Val Val Pro 

340 345 



335 



CCT TTC GGA CAT GGA GTT CTG CGC AAG ACT GAT CCA AGA TAC ACA TGC 
Ty HI Tv Ss Gly val Leu Arg Lys Thr Asp Pro Arg Tyr Thr Cys 

350 355 



CAG AGA GAG TTC GCT TTG AAG CAT TTG CCT GAA GAT CCA CTG TTT CAA 
Gin Arg Glu Phe Ala Leu Lys His Leu Pro Glu Asp Pro Leu Phe Gin 
370 375 

TTG GTT GCA AAA CTC TAC GAA GTG TTC CTC CAA TTC TTA CAG AAC TTG 
Su Val aS i£ Leu Tyr Glu Val Phe Leu Gin Phe Leu Gin Asn Leu 
385 3 9° 39b 



CCA AAG TTA AAC CCT TGG CCA AAT GTT GAT GCC CAC AGT GGT GTG TTG 
A S L^s Hn Sin Pro Trp Pro Asn Val Asp Ala His Ser Gly Val Leu 



400 



405 



TTG AAC TAT TAT GGT TTA ACT GAA GCA AGA TAT TAT ACG GTC CTC TTT 
Hu Sin Tyr ryr Gly Leu Thr Glu Ala Arg Tyr Tyr Thr Val Leu Phe 



415 



420 



GGT GTA TCA AGA GCT CTT GGC ATT TGC TCT CAG CTA ATT TGG GAC CGA 
Gly Val Ser Arg Ala Leu Gly He Cys Ser Gin Leu He Trp Asp Arg 



430 



435 



GCT CTT GGA TTG CCA CTA GAG AGG CCA AAG AGT GTC ACA ATG GAG TGG 
All Tel Ty Leu Pro Leu Glu Arg Pro Lys Ser Val Thr Met Glu Trp 

450 4bb 
CTT GAG AAC CAT TGC AAG AAA GCA TGATTTGTTT GAAATCTCTG C GAGC AT AAA 
Leu Glu Asn His Cys Lys Lys Ala 
465 

AGCACAATGT AAAAT CTTTA TGAATAATTG CTT GAGAAAG CAGTTTTTTC TTGGAGCCAA 
GGTAGGTCGC AT TAG GAT GT T CAT CGATT G GCTTAGTACG GTTTTGAAAG ATTTTGGTTG 
TGTATTTTCA GTTTCGGTTT TAAAAATGTT AT AC C AAT AC CT TAT C GAT A TAAATTCAAT 
AT GAT T C GAT TTTTTACTTT TGTTTGAAAA AAAAAACAAA AAAAAAAAAA AAAAAAAAAA 
A 



1068 

1116 

1164 

1212 

1260 

1308 

1356 

1404 

1452 

1506 

1566 
1626 
1686 
1746 
1747 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



64 - 

Ser Lys Leu Arg Ser Arg 



Met Val Phe Tyr Arg Gly Val Ser Leu Leu . ^ 

1 



5 1° 



Ala Val Gin Gin Thr Asn Leu Ser Asn Ser Val Arg Trp Leu Gin Val 

20 25 
Gin Thr Ser Ser Gly Leu Asp Leu Arg Ser Glu Leu Gin Glu Leu He 



35 



40 



Pro Glu Gin Gin Asp Arg Leu Lys Lys Leu Lys Ser Glu His Gly Lys 
50 55 60 

val Gin Leu Gly Asn lie Thr Val Asp Met Val Leu Gly Gly Met Arg 



65 



70 



Gly Met Thr Gly Leu Leu Trp Glu Thr Ser Leu Leu Asp Pro Asp Glu 
85 90 

Gly He Arg Phe Arg Gly Leu Ser He Tyr Glu Cys Gin Lys Val Leu 

100 105 

Pro Ala Ala Lys Pro Gly Gly Glu Pro Leu Pro Glu Gly Leu Leu Trp 
115 120 I* 5 

L eu Leu Leu Thr Gly Lys Val Pro Ser Lys Glu Gin Val Asp Ser Leu 

130 135 l^O 

Ser Gin Glu Leu Arg Ser Arg Ala Thr Val Pro Asp Hi, Val Tyr Lys 

145 1^0 155 

Thr lie Asp Ala Leu Pro Val Thr Ala His Pro Met Thr Gin Phe Ala 

165 i/u 

Thr Gly Val Met Ala Leu Gin Val Gin Ser Glu Phe Gin Lys Ala Tyr 

180 185 



Glu Lys Gly He His Lys Ser Lys 



Leu Trp Glu Pro Thr Tyr Glu Asp 



195 



200 



205 



Ser Met Ser Leu He Ala Gin Val Pro Leu Val Ala Ala Tyr Val Tyr 
210 215 220 

Arg Arg Met Tyr Lys Asn Gly Asn Thr He Pro Lys Asp Asp Ser Leu 
225 230 23b 

Asp Tyr Gly Ala Asn Phe Ala His Met Leu Gly Phe Ser Ser Ser Asp 

245 2bU 

Met His Glu Leu Met Lys Leu Tyr Val Thr He His Ser Asp His Glu 

260 265 
Gly Gly Asn val Ser Ala His Thr Gly His Leu Val Ala Ser Ala Leu 

275 280 

Ala Ala Ala Leu Asn Gly Leu Ala Gly 

290 295 



Ser Asp Pro Tyr Leu Ser Phe 



) ro Leu His Gly Leu Ala Asn Gl 



n Glu Val Leu Leu Trp He Lys Ser 



305 



310 



315 



320 



val val Glu Glu Cys Gly Glu Asn He Ser Lys Glu Gin Leu Lys Asp 

— - 330 



325 
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Tyr Ala Trp Lys Thr Leu Lys Ser Gly Lys Val Val Pro Gly Phe Gly 

340 345 
His Gly Val Leu Arg Lys Thr Asp Pro Arg Tyr Thr Cys Gin Arg Glu 

355 360 
Phe Ala Leu Lys His Leu Pro Glu Asp Pro Leu Phe Gin Leu Val Ala 

370 375 
Lys Leu Tyr Glu Val Phe Leu Gin Phe Leu Gin Asn Leu Ala Lys Leu 
385 390 

,r t 7\ia Hiq qpr Glv Val Leu Leu Asn Tyr 

Asn Pro Trp Pro Asn Val Asp Ala His Ser Giy 

405 410 

Tyr G ly Leu Thr Glu Ala Arg Tyr Tyr Thr Val Leu Phe Gly Val Ser 

420 425 
Arg Ala Leu Gly He Cys Ser Gin Leu He Trp As P Arg Ala Leu Gly 

435 440 
Leu Pro Leu Glu Arg Pro Lys Ser Val Thr Met Glu Trp Leu Glu Asn 



450 



His Cys Lys Lys Ala 
465 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AAGTGGATCC ATGGT GTTTT TCCGCAGCGT AT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 



32 



(iii) HYPOTHETICAL: YES 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



CAT AG GAT C C TTAAGCAGAT GAAGCTTTCT TA 



32 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii ) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GATCGGTACC ATGTACAGAT GCGCATCGTC T 31 
(2) INFORMATION FOR SEQ ID NO : 1 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

( iii ) HYPOTHETICAL : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTACGGATCC CTTGGTTGCA AC AGCAGC T G A 31 
(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

( iii ) HYPOTHETICAL : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 



+ • 
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CTAGGGATCC ATGTCAGCGA TATTATCAAC AACTAGCAAA AGT 4 3 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATTGGATCC TTAGTTCTTA CTTTCGATTT TCTTTACCAA CTC 4 3 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTAGGGATCC AT GGCT GAT A CAAAAGCAA 2 9 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GATTGGATCC TTAACGCTTG ATATCGCTT 



29 



